AM-FM based filter bank analysis for estimation of spectro-temporal envelopes and its application for speaker recognition in noisy reverberant environments

نویسندگان

  • Dhananjaya N. Gowda
  • Rahim Saeidi
  • Paavo Alku
چکیده

In this paper, a new AM-FM based filter bank analysis for the estimation of spectro-temporal envelope (STE) of speech signals is proposed. The filter bank is simulated by filtering a frequency translated signal using a single resonator centered around the Nyquist frequency. The proposed design of using a single fixed resonator provides distinct advantages over the traditional methods of filter bank design. First, it provides a simple IIR filter with a smooth frequency response with no ripples. Second, the bandwidth of the resonator can be easily controlled by the multiplicity of poles and their proximity to the unit circle on the z-plane. Third, the resonator fixed at the highest possible center frequency provides the best separation between the AM and FM components of the filtered signal. Speaker recognition experiments on noisy and reverberant speech with short test segments show that the proposed AM-FM based filter bank analysis for STE estimation provides consistent improvement over a recently proposed discrete cosine transform based filter bank approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Concurrent speaker localization using multi-band position-pitch (m-popi) algorithm with spectro-temporal pre-processing

Accurate, microphone-based speaker localization in real-world environments, like office spaces or meeting rooms, must be able to track a single speaker and multiple concurrent speakers in the presence of reverberations and background noise. Our Multiband Joint Position-Pitch (M-PoPi) algorithm for circular microphone arrays already shows a frame-wise localization estimation score of about 95% f...

متن کامل

Spectro-temporal processing for blind estimation of reverberation time and single-ended quality measurement of reverberant speech

Auditory spectro-temporal representations of reverberant speech are investigated for blind estimation of reverberation time (RT ) and for single-ended measurement of speech quality. The auditory representations are obtained from an eight-filter filterbank which is used to extract the modulation spectra from temporal envelopes of the speech signal. Gaussian mixture models (GMM), one for each mod...

متن کامل

Robust speaker recognition using spectro-temporal autoregressive models

Speaker recognition in noisy environments is challenging when there is a mis-match in the data used for enrollment and verification. In this paper, we propose a robust feature extraction scheme based on spectro-temporal modulation filtering using two-dimensional (2-D) autoregressive (AR) models. The first step is the AR modeling of the sub-band temporal envelopes by the application of the linea...

متن کامل

Separable spectro-temporal Gabor filter bank features: Reducing the complexity of robust features for automatic speech recognition.

To test if simultaneous spectral and temporal processing is required to extract robust features for automatic speech recognition (ASR), the robust spectro-temporal two-dimensional-Gabor filter bank (GBFB) front-end from Schädler, Meyer, and Kollmeier [J. Acoust. Soc. Am. 131, 4134-4151 (2012)] was de-composed into a spectral one-dimensional-Gabor filter bank and a temporal one-dimensional-Gabor...

متن کامل

Verified speaker localization utilizing voicing level in split-bands

This paper proposes a joint verification-localization structure based on split-band analysis of speech signal and the mixed voicing level. To address the problems in reverberant acoustic environments, a new fundamental frequency estimation algorithm is proposed based on high resolution spectral estimation. In the reconstruction of the distorted speech this information is utilized to reduce the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015